Tag

#content moderation

8 articles

The Facebook insider building content moderation for the AI era

Moonbounce raises $12 million to scale its AI control engine that translates content moderation policies into consistent AI behavior. The startup, founded by a former Facebook insider, aims to solve the challenge of maintaining predictable AI governance as AI systems take on more content moderation responsibilities.

Apr 310

Penemue raises €1.7M to scale AI hate speech detection

Learn to build a real-time hate speech detection system using Python and transformer models, similar to Penemue's AI platform for identifying online hate and digital violence.

Apr 39

A former Swiss president just filed criminal charges over AI-generated abuse. The target is Grok.

Learn to build an AI content analysis tool that can detect potentially problematic language in chatbot responses, similar to the legal issues surrounding Grok and Swiss former president Karin Keller-Sutter.

Apr 17

Anthropic took down thousands of GitHub repos trying to yank its leaked source code — a move the company says was an accident

This article explains how automated AI systems for content moderation can produce erroneous takedown notices, examining the technical architecture, trade-offs, and legal implications of such systems.

Apr 18

tech

Meta's own supervisory body warns that Community Notes are no match for AI disinformation

Meta's Oversight Board warns that Community Notes are ill-equipped to handle the growing threat of AI-generated disinformation, especially in vulnerable regions.

Mar 2731

OpenAI abandons yet another side quest: ChatGPT’s erotic mode

OpenAI has discontinued its erotic mode for ChatGPT, following a pattern of abandoning experimental features amid regulatory pressure and ethical concerns. The move reflects the company's strategic shift toward prioritizing safety and responsible development.

Mar 2627

tech

Meta rolls out new AI content enforcement systems while reducing reliance on third-party vendors

Meta has launched new AI content enforcement systems to improve platform safety and reduce reliance on third-party vendors. The company claims these tools will detect more violations with greater accuracy and respond more quickly to real-world events.

Mar 1936

tech

Meta’s deepfake moderation isn’t good enough, says Oversight Board

Meta's deepfake detection methods are insufficient for handling misinformation during armed conflicts, according to its own Oversight Board. The board is calling for a major overhaul of how the company identifies and surfaces deepfake content.

Mar 1038